OpenCoS: Contrastive Semi-supervised Learning for Handling Open-Set Unlabeled Data

نویسندگان

چکیده

Semi-supervised learning (SSL) has been a powerful strategy to incorporate few labels in better representations. In this paper, we focus on practical scenario that one aims apply SSL when unlabeled data may contain out-of-class samples - those cannot have one-hot encoded from closed-set of classes label data, i.e., the is an open-set. Specifically, introduce OpenCoS, simple framework for handling realistic semi-supervised based upon recent self-supervised visual representation learning. We first observe open-set dataset can be identified effectively via contrastive Then, OpenCoS utilizes information overcome failure modes existing state-of-the-art methods, by utilizing pseudo-labels and soft-labels in- respectively. Our extensive experimental results show effectiveness under presence samples, fixing up methods suitable diverse scenarios involving data.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimate Unlabeled-Data-Distribution for Semi-supervised PU Learning

Traditional supervised classifiers use only labeled data (features/label pairs) as the training set, while the unlabeled data is used as the testing set. In practice, it is often the case that the labeled data is hard to obtain and the unlabeled data contains the instances that belong to the predefined class beyond the labeled data categories. This problem has been widely studied in recent year...

متن کامل

Using Unlabeled Data for Supervised Learning

Many classification problems have the property that the only costly part of obtaining examples is the class label. This paper suggests a simple method for using distribution information contained in unlabeled examples to augment labeled examples in a supervised training framework. Empirical tests show that the technique described in this paper can significantly improve the accuracy of a supervi...

متن کامل

Semi-Supervised Support Vector Machines for Unlabeled Data Classification

A concave minimization approach is proposed for classifying unlabeled data based on the following ideas: (i) A small representative percentage (5% to 10%) of the unlabeled data is chosen by a clustering algorithm and given to an expert or oracle to label. (ii) A linear support vector machine is trained using the small labeled sample while simultaneously assigning the remaining bulk of the unlab...

متن کامل

Enhancing Supervised Learning with Unlabeled Data

In many practical learning scenarios, there is a small amount of labeled data along with a large pool of unlabeled data. Many supervised learning algorithms have been developed and extensively studied. We present a new \co-training" strategy for using un-labeled data to improve the performance of standard supervised learning algorithms. Unlike much of the prior work, such as the co-training pro...

متن کامل

Semi-supervised learning with committees: exploiting unlabeled data using ensemble learning algorithms

Supervised machine learning is a branch of artificial intelligence concerned with learning computer programs to automatically improve with experience through knowledge extraction from examples. It builds predictive models from labeled data. Such learning approaches are useful for many interesting real-world applications, but are particularly useful for tasks involving the automatic categorizati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2023

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-25063-7_9